DS-Net: Dynamic spatiotemporal network for video salient object detection

نویسندگان

چکیده

As moving objects always draw more attention of human eyes, the temporal motion information is exploited complementarily with spatial to detect salient in videos. Although efficient tools such as optical flow have been proposed extract information, it often encounters difficulties when used for saliency detection due movement camera or partial objects. In this paper, we investigate complementary roles and propose a novel dynamic spatiotemporal network (DS-Net) effective fusion information. We construct symmetric two-bypass explicitly features. A weight generator (DWG) designed automatically learn reliability corresponding branch. And top-down cross attentive aggregation (CAA) procedure facilitate Finally, features are modified by guidance coarse map then go through decoder part final map. Experimental results on five benchmarks VOS, DAVIS, FBMS, SegV2, ViSal demonstrate that method achieves superior performance than state-of-the-art algorithms. The source code available at https://github.com/TJUMMG/DS-Net.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Video Salient Object Detection Using Spatiotemporal Deep Features

This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...

متن کامل

Salient Region Detection in Video Using Spatiotemporal Visual Attention Model

Abstract Salient region detection is very useful in video analysis. A salient region detection method based on spatiotemporal visual attention model is proposed in this paper. Visual attention mechanism is used to generate saliency map of the image sequence. Spatial saliency map is computed in accordance with some predefined features including intensity, color and orientation. Temporal visual s...

متن کامل

MSDNN: Multi-Scale Deep Neural Network for Salient Object Detection

Salient object detection is a fundamental problem and has been received a great deal of attentions in computer vision. Recently deep learning model became a powerful tool for image feature extraction. In this paper, we propose a multi-scale deep neural network (MSDNN) for salient object detection. The proposed model first extracts global high-level features and context information over the whol...

متن کامل

Efficient Co-Salient Video Object Detection Based on Preattentive Processing

Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...

متن کامل

Impression Network for Video Object Detection

Video object detection is more challenging compared to image object detection. Previous works proved that applying object detector frame by frame is not only slow but also inaccurate. Visual clues get weakened by defocus and motion blur, causing failure on corresponding frames. Multiframe feature fusion methods proved effective in improving the accuracy, but they dramatically sacrifice the spee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Digital Signal Processing

سال: 2022

ISSN: ['1051-2004', '1095-4333']

DOI: https://doi.org/10.1016/j.dsp.2022.103700